Tracking Revisited using RGBD Camera: Baseline and Benchmark

نویسندگان

  • Shuran Song
  • Jianxiong Xiao
چکیده

Although there has been significant progress in the past decade, tracking is still a very challenging computer vision task, due to problems such as occlusion and model drift. Recently, the increased popularity of depth sensors (e.g. Microsoft Kinect) has made it easy to obtain depth data at low cost. This may be a game changer for tracking, since depth information can be used to prevent model drift and handle occlusion. In this paper, we construct a benchmark dataset of 100 RGBD videos with high diversity, including deformable objects, various occlusion conditions and moving cameras. We propose a very simple but strong baseline model for RGBD tracking, and present a quantitative comparison of several state-of-the-art tracking algorithms. Experimental results show that including depth information and reasoning about occlusion significantly improves tracking performance. The datasets, evaluation details, source code for the baseline algorithm, and instructions for submitting new models will be made available online after acceptance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Depth Masked Discriminative Correlation Filter

Depth information provides a strong cue for occlusion detection and handling, but has been largely omitted in generic object tracking until recently due to lack of suitable benchmark datasets and applications. In this work, we propose a Depth Masked Discriminative Correlation Filter (DM-DCF) which adopts novel depth segmentation based occlusion detection that stops correlation filter updating a...

متن کامل

Evaluating Appearance Models for Recognition, Reacquisition, and Tracking

Traditionally, appearance models for recognition, reacquisition and tracking problems have been evaluated independently using metrics applied to a complete system. It is shown that appearance models for these three problems can be evaluated using a cumulative matching curve on a standardized dataset, and that this one curve can be converted to a synthetic disambiguation rate for single camera t...

متن کامل

Research on facial features detection using RGBD camera

Facial features detection, which is a very active research direction in computer vision. It is widely used in many computer vision applications, such as animation, face recognition, expression analysis and transfer. RGBD camera is a new input device which could provide depth and RGB image. Some new methods were proposed based on the RGBD camera. In this paper, we review the works of facial feat...

متن کامل

Large Scale 3D Mapping of Indoor Environments Using a Handheld RGBD Camera

The goal of this research is to investigate the problem of reconstructing a 3D representation of an environment, of arbitrary size, using a handheld color and depth (RGBD) sensor. The focus of this dissertation is to examine four of the underlying subproblems to this system: camera tracking, loop closure, data storage, and integration. First, a system for 3D reconstruction of large indoor plana...

متن کامل

Efficient Onboard RGBD-SLAM for Autonomous MAVs

We present a computationally inexpensive RGBDSLAM solution taylored to the application on autonomous MAVs, which enables our MAV to fly in an unknown environment and create a map of its surroundings completely autonomously, with all computations running on its onboard computer. We achieve this by implementing efficient methods for both tracking its current location with respect to a heavily pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1212.2823  شماره 

صفحات  -

تاریخ انتشار 2012